ality Assurance of Bioinformatics Soware: A Case Study of Testing a Biomedical Text Processing Tool Using Metamorphic Testing∗

نویسندگان

  • Madhusudan Srinivasan
  • Morteza Pourreza Shahri
  • Indika Kahanda
  • Upulee Kanewala
چکیده

Bioinformatics so‰ware plays a very important role in making critical decisions within many areas including medicine and health care. However, most of the research is directed towards developing tools, and liŠle time and e‚ort is spent on testing the so‰ware to assure its quality. In testing, a test oracle is used to determine whether a test is passed or failed during testing, and unfortunately, for much of bioinformatics so‰ware, the exact expected outcomes are not well de€ned. Œus, the main challenge associated with conducting systematic testing on bioinformatics so‰ware is the oracle problem. Metamorphic testing (MT) is a technique used to test programs that face the oracle problem. MT uses metamorphic relations (MRs) to determine whether a test has passed or failed and speci€es how the output should change according to a speci€c change made to the input. In this work, we useMT to test LingPipe, a tool for processing text using computational linguistics, o‰en used in bioinformatics for bio-entity recognition from biomedical literature. First, we identify a set of MRs for testing any bio-entity recognition program. Œen we develop a set of test cases that can be used to test LingPipe’s bio-entity recognition functionality using these MRs. To evaluate the e‚ectiveness of this testing process, we automatically generate a set of faulty versions of LingPipe. According to our analysis of the experimental results, we observe that our MRs can detect the majority of these faulty versions, which shows the utility of this testing technique for quality assurance of bioinformatics so‰ware.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quality Assurance of Bioinformatics Software: A Case Study of Testing a Biomedical Text Processing Tool Using Metamorphic Testing

Bioinformatics so‰ware plays a very important role in making critical decisions within many areas including medicine and health care. However, most of the research is directed towards developing tools, and liŠle time and e‚ort is spent on testing the so‰ware to assure its quality. In testing, a test oracle is used to determine whether a test is passed or failed during testing, and unfortunately...

متن کامل

Automatic Detection of Defects in Applications without Test Oracles

In application domains that do not have a test oracle, such as machine learning and scientific computing, quality assurance is a challenge because it is difficult or impossible to know in advance what the correct output should be for general input. Previously, metamorphic testing has been shown to be a simple yet effective technique in detecting defects, even without an oracle. In metamorphic t...

متن کامل

Chapter 16: Text Mining for Translational Bioinformatics

Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results ...

متن کامل

MT4WS: an automated metamorphic testing system for web services

The use of web services has been growing significantly, with increasingly large numbers of applications being implemented through the web. A difficulty associated with this development is the quality assurance of these services, specifically the challenges encountered when testing the applications – amongst other things, testers may not have access to the source code, and the correctness of the...

متن کامل

Arabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018